\0>AOM  978  FLORIDA  STATE  UNIV  TALLAHASSEE  DEPT  OF  STATISTICS  F/G  12/1 

TESTING  TO  DETERMINE  THE  UNDERLYING  DISTRIBUTION  USING  RANOOMLY--ETC(U) 
APR  78  M HOLLANDER*  F PROSCHAN  AFOSR-76-3109 

JNCLASSIFIED  FSU-STATISTICS-H458  AFOSR-TR-78*0939  NL 


TESTING  TO  DETERMINE  THE  WJDERLYING  DISTRIBUTION 
USING  RANDOMLY  CENSORED  DATA 


by  "^0 

■>  ^ c- 

Flyles  Hollandor  and  Frank  Proschan 

PSU  Statistics  Report  M458 
AFOSRX  Technical  Report  No.  6 
AFOSR  Technical  Report  No.  79 


r'?l,'a3o ; 


April,  1978 

The  Florida  State  University 
Department  of  Statistics 
Tallahassee,  Florida  32306 


Research  sponsored  by  the  Air  Force  Office  of  Scientific  Research,  AFSC,  USAF, 
under  Grants  AFOSR  76-3109  and  AFOSR  74«8D6W.  The  United  States  Government 
is  authorized  to  reproduce  and  distribute  reprints  for  governmental  purposes. 


release: 


Jk  **  public  rel« 

^ [SsliTlinllnn  OnBojlted 


D D C 


JUK  9 19T8 


Testing  to  Determine  the  Underlying  Distribution 
Using  Randomly  Censored  Data 


Myles  Hollander  and  Frank  Proschan 


SUMMARY 


For  right -censored  data,  we  develop  a goodness-of-fit  procedure  for  testing 
whether  the  underlying  distribution  is  a specified  function  G.  Our  test  statistic 

A 

C is  the  one-sample  limit  of  Efron's  (1967)  two-sample  statistic  N.  The  test 
based  on  C is  conqpared  with  recently  proposed  competitors  due  to  Koziol  and 
Green  (1976)  and  Hyde  (1977) . The  comparisons  are  on  the  basis  of  (i)  applicability 
(ii)  the  extent  to  which  the  censoring  distribution  can  affect  the  inference,  and 


(iii)  power.  It  is  shown  that  in  certain  situations  the  C test  compares  favourably 


with  the  tests  of  Koziol -Green  and  Hyde 


Same  Key  Words:  Goodness-of-fit  test;  Kaplan-Meier  estimator;  Right -censored  data 
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1.  INTRODUCTION  ' ' ’ 

In  the  classical  non-censored  one-sample  goodness-of-fit  problem,  one  observes 
a random  sample  from  a population  with  distribution  function 

F(x)  ■ P(X  s x);  the  corresponding  survival  function  is  F(x)  » P(X  > x)  « 1 - F(x). 
The  null  hypothesis  asserts  that  F(x)  > G(x),  where  G is  completely  specified. 

The  need  to  generalize  this  problem  to  encompass  censored  data  arises  because  in 
some  situations,  such  as  clinical  trials,  or  life  testing,  the  X's  may  represent 
times  to  the  occurrence  of  an  end-point  event  and  the  data  are  usually  analyzed 
before  all  patients,  or  items  on  test,  have  experienced  the  event.  In  the  clinical 
trials  context  the  end-point  event  could,  for  example,  be  relapse,  pregnancy,  or 
death.  In  the  life-testing  framework,  the  end-point  event  could  be  failure  of  the 
inner  ring  of  ball  bearings  which  are  on  test.  In  these  cases  the  observations 
can  be  viewed  as  pairs  (Z^,  6^),  i » l,...,n,  where 

Z.  - min(X.,  T.),  (1.1) 

1 if  Zj^  « X^  (i^^  observation  is  uncenscred) , 

(1.2) 

0 if  Zj^  “ Tj^  (i^^  observation  is  censored) , 

where  T^  is  the  time  to  censorship  of  the  i^^  observation.  Here  we  assume  that 

Xi,...,Xn  are  independent  and  identically  distributed  according  to  a continuous 

distribution  F,  T^,...,T^  are  independent  and  identically  distributed  according 

to  a continuous  censoring  distribution  H(x)  « P(T  s x),  and  furthermore  the  T's 
and  the  X's  are  assumed  mutually  independent.  The  censoring  distribution  H is 
typically,  though  not  necessarily,  unknown  and  is  treated  as  a nuisance  parameter. 


^ ' 1 

‘ 
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The  goodness-of-flt  hypothesis  is 

Hq:  FCx)  - G(x).  all  x,  (1.3) 

where  G is  completely  specified.  Of  course,  if  the  censoring  distribution  H is 

such  that  it  prevents  us  from  ''seeing"  F throughout  the  support  of  G,  we  will  be  | 

I 

unable  to  use  the  data  to  test  if  Hq  is  true.  Thus  throughout  we  make  the  practical  1 

assumption  that  support (G)  c support (H). 

In  Section  2 we  introduce  the  C statistic,  which  is  the  one-sample  limit  of 

A 

Efron's  two-sample  statistic  W.  In  Section  3 our  test  of  based  on  C is 

compared  with  competitors  due  to  Koziol  and  Green  (1976)  and  Hyde  (1977),  and  various 
advantages  and  disadvantages  of  the  three  procedures  are  noted.  For  example,  the 
Koziol -Green  test  requires  the  restrictive  assumption  that  the  censoring  distribution 
H and  the  true  life  distribution  P satisfy  the  relationship  given  by  (3.2), 
namely  that  H = for  some  S,  0 < 6 < 2.  However  when  this  condition  is  indeed 
satisfied,  the  Koziol -Green  test  is  consistent  against  a broader  class  of  alternatives 
than  the  C test  or  Hyde's  test.  Hyde^s  test  has  the  disadvantage  that  the  inference 
to  be  drawn  from  the  test  can  be  adversely  affected  by  the  nuisance  parameter  H. 

The  C test  imposes  only  a relatively  mild  restriction  on  H,  namely  that  the 
integral  in  (2.6)  converges.  Furthermore,  the  C statistic  (2.5)  estimates 
-/GdP(»  /FdG),  independently  of  H.  The  C test  is  however  limited  in  the  sense 
that  it  is  unable  to  detect  F alternatives  to  Hq  for  which  /FdG  » 1/2. 

Section  3 also  contains  a Monte  Carlo  power  comparison  of  the  three  competitors 
for  (a)  normal  location  alternatives  to  a hypothesized  standard  normal,  and  (b) 
exponential  scale  alternatives  to  a hypothesized  exponential  with  scale  parameter  1. 

In  this  limited  study,  the  C test  compares  favourably  with  the  tests  of  Koziol -Green 
and  Hyde. 

Section  4 contains  an  application  of  the  three  goodness-of-fit  tests  to  an  updated 


version  of  clinical  trial  data  analyzed  by  Koziol  and  Green  (1976) . 
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2.  THE  ONE-SAMPLE  LIMIT  OF  EFRON'S  TWO-SAMPLE  TEST 

Efron  (1967)  has  considered  the  two-sample  problem  with  right-censored  data. 
In  addition  to  the  Z's  and  5's  defined  by  (1.1)  and  (1.2),  one  observes  the 
analogous  quantities 


Zr  » min(Y^,  Sj^), 


e 


i 


fl  if  21 


’i- 


0 if 


(2.1) 


(2.2) 


where  i « l,...,m,  and  m denotes  the  size  of  sample  2.  Here  Y,,...,Y  are 

1 m 

independent  and  identically  distributed  according  to  an  unknown  distribution  G, 
Sj^,...,S^  are  independent  and  identically  distributed  according  to  I(x)  * P(S  i x), 

and  the  X's,  T's,  Y’s,  and  S's  are  assumed  mutually  independent. 

In  the  two-sample  problem,  the  null  hypothesis  is 

hJ  : F(x)  = G(x),  all  X,  (2.3) 


* 

where  the  hypothesized  common  distribution  is  unspecified.  Efron's  test  of  H^ 
is  based  on  the  statistic 


W - -/g’(x)dF(x) , 


(2.4) 


where  F,  G are  the  Kaplan-Meier  (1958)  estimators  of  F,  IT  respectively.  Letting 
Z(l)  ^ ^(2)  ^ ^ ^(n)  ^^®  ordered  Z's, 


P(x) 


k-1  S. 

• ^n^{(n-j)/(n-j*l)}  X e 


and  F^(x)  ■ 0 for  x > Z 


(n)’ 


Of  course  F is  defined  analogously. 
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Invoking  an  idea  due  to  Moses  (1964) , we  let  the  sample  size  m increase 

♦ * 

without  limit.  Then  the  unknown  G becomes  known,  reduces  to  H^,  and  W 

becomes 

C » -/G(x)d?(x).  (2.5) 

The  asymptotic  mean,  variance  and  distribution  of  C in  this  one-sample 

A 

framework  are  directly  obtainable  from  Efron's  asymptotic  results  concerning  W. 
Equivalently,  asymptotic  properties  of  C can  be  derived  directly  from  the  result 
(cf. Efron  (1967),  Breslow  and  Crowley  (1974),  Meier  (1975))  that  the  stochastic 
process  n^{F(s)  - F(s)}  tends,  as  n-^<»,  to  a Gaussian  process  with  mean  0 and 
covariance  kernel 

_ s _ _ _ 

r(s,  t)  - -F(s)F(t)  / {;;(z)F(z)}‘Mf(z),  s s t, 

•00 

where  ir(z)  ■ F(z)ff(z) . It  follows  from  the  continuous  mapping  theorem  that 
/n*‘{F(x)  - F(x)}dG(x)  -*•  N(0,  o^) , 

where 

“ / / r(s,  t)dG(s)dG(t)  ♦ / / r(s,  t)dG(s)d2’(t) 

sst  s>t 

00  00  S 

» -2  / / / [{F(s)F(t)/K(z)F(z)}dF(z)dG(t)dG(s). 

•00  s -00 

Ef:"?n  shows  that,  under  H^,  o^  reduces  to 

o2  . 4‘1  / z3[K{G"^z)})"^  dz  = 4'^  / z2[H{G"\z)>]”^  dz,  (2.6) 

® 0 0 


and  he  notes  that  the  integral  in  (2.6)  fails  to  converge  if  H(z)  - 0C{G(z)}3j  as 


z 


I,/. 

K"-, 

•A-r-l 


approaches  0.  If  is  finite  and  if  is  a consistent  estimator  of 
then  under 

* def-  Vc  > ’ 

C » n^(C  - h)/a  -*■  N(0,  1). 

Assuming  is  finite,  one  consistent  estimator  of  is 


52  . 4'*  / iHk  CG'*(i))"'<lr, 


where  K^,  the  empirical  survival  function  of  the  Z's,  is 


(2.7) 


IC„Cx) 


(n-i+l)/n,  ^(i-1)  ^ ^ ^(i)* 

" ^n)  ^ 


where  Expression  (2.7)  can  be  simplified  to 


a2  - 16'1  nn/(n-i+l)}C{G(Z^._jj)}‘*  - {(r(Z^. j) }** J. 

* 

Let  X be  distributed  according  to  G and  let  X be  independent  of  X 

* * 

and  have  distribution  G . To  test  versus  one-sided  alternatives  F « G 

* * 

where  P(X  2 X ) < ?s,  we  reject  if  C < -z^,  and  accept  Hq  otherwise.  To 

* * 

test  Hq  versus  one-sided  alternatives  F = G where  P(X  2 X)  > we  reject 

* 

if  C > z , and  accept  H-  otherwise.  Here  z is  the  upper  a percentile 


point  of  a standard  normal  distribution. 

When  there  is  no  censoring,  our  goodness-of-fit  test  based  on  C reduces  to 

Moses'  (1964)  goodness-of-fit  test  based  on  the  one-sample  limit  of  Wilcoxon's 

n _ 

tt.o-sample  statistic.  That  is,  with  no  censoring,  C * ^ G(X.)/n  which  under  H. 

i»l 

is  distribution -free  with  distribution  that  of  the  average  of  n independent  uniform 
random  variables.  The  test  then  refers  (12n)  (C-^s)  to  the  standard  normal  distri- 


bution. 


-fff.f 


K- 


Although  our  continuity  assumptions  preclude  ties,  in  practice  if  censored 
observations  are  tied  with  uncensored  observations,  the  convention  when  forming 


the  list  of  the  treat  the  uncensored  members  of  the  tie  as  being 


less  than  the  censored  members  of  the  tie. 


a 
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3.  COMPARISONS  OF  THE  C TEST,  THE^KOZIO^tGREEN, TEST,  AND  HYDE'S  TEST 

Competing  tests  of  Hq  have  recently  been  proposed  by  Koziol  and  Green  (1976) 
and  Hyde  (1977). 

Koziol-Green  (1976)  test:  Apply  the  probability  integral  transformation  to 
the  Z's  to  form  new  pairs  (V^^,  6^),  where  * min(U^,  L^^),  “ G(Xj^) , 

» G(T^),  and  6^^  is  as  before.  This  reduces  the  problem  to  testing  whether 

the  distribution  of  the  U's  is  uniform  on  (0,  1).  The  Koziol-Green  statistic, 
a generalization  of  the  Cram6r-von  Mises  statistic  to  the  righ" -censored  situation. 


1 . 

» n / {Fjj(t)  - t}2dt. 


(3.1) 


A 

where  Fy  is  the  Kaplan-Meier  estimator  of  the  distribution  of  U. 

Koziol  and  Green  derive  the  asymptotic  distribution  of  under  the  restriction 
that  the  censoring  distribution  H be  related  to  the  survival  distribution  F via 


H = F®, 


(3.2) 


for  some  B,  0 < B < 2.  For  this  model, 

P(«i  = 0)  » / (I  - F®)dF=  B/(B+1), 

so  that  Koziol  and  Green  interpret  B as  the  censoring  parameter.  Koziol  and 
Green's  asymptotic  theory  for  restricts  B to  be  less  than  2,  and  they  give 

asymptotic  critical  points  of  for  the  models  B = 0 (no  censoring)  and 

B “ .5,1,  and  1.5.  Thus  to  implement  the  test,  the  user  must  know  B or 


r 


rW-i 
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estimate  8 from  the  data.  The  estimator  8 used  by  Koziol'and  Green  is  obtained 
by  setting 


0/(6  + 1)  = X (1  - 6J/n  = P-. 
i=l  ^ 


or 


6 - P^./(l  - Pg) 


(3.3) 


If  the  known  or  estimated  value  of  6 is  not  one  of  the  tabled  values,  the  user 
can  choose  between  interpolating  or  deriving  new  asymptotic  percentage  points. 
The  condition  8 < 2 indicates  that  the  Koziol -Green  test  will  be  inappropriate 
when  the  expected  proportion  of  censored  observations  is  2/3  or  more.  Note  that 


under  Hq,  the  variance  expression  (2.6),  for  the  special  case  of  model  (3.2), 


reduces  to 


°n  “ / z^"®dz, 

" 0 


which  will  converge  if  6 < 3.  This  indicates  that  the  C test,  in  the  special 
setting  of  model  (3.2),  can  be  used  when  the  expected  proportion  of  censored  obser- 
vations is  less  than  3/4. 

Hyde  (1977)  test:  Hyde  has  generalized  the  right -censored  model  to  include 
cases  irtiere  subject  i may  enter  the  study  some  time  after  his  lifetime  has 
started . 

Let 


a^  ■ log  (r(vp  - log  G(Zj), 


A ■ X («i  - a.) 
i-1  ^ ^ 


) 


i i 


J 


and  set 


Hyde  shows  that  the  statistic 


• n -iJs 

D » V X a.  N(0,  1). 
i=l 


n ]>s 

where  ^ a.  is,  under  H-,  a consistent  estimator  of  the  standard  deviation 

i-1  ^ " 

\ / 

of  A.  Hyde  makes  the  additional  assumption  that  E(a^)  be  finite,  but  that  this 

condition  is  automatically  satisfied  follows  from  Hyde's  result  that  E(6^  - a^^)  = 0. 

Hyde's  statistic,  when  specialized  to  our  model  by  setting  = 0 for  all  i, 
becomes 


X Ui  > log  G{Zp 


n _ |*s 

■ X log  G(Z.)  . 
i=l 


(3.4) 


If  the  failure  rate  r(x)  * g(x)/G(x)  exists,  where  g(x)  * (d/dx)G(x),  then  D 
can  be  written  as 


XU.  - / r(u)du 
i*l  ^ 0 


n Zi 


X / r(u)du 
i««l  0 


When  D is  significantly  large  (small),  H^,  is  to  be  rejected  in  favour  of 


the  alternative  that  the  true  average  failure  rate  is  larger  (smaller)  than  the 
average  failure  rate  of  the  hypothesized  distribution  G. 


One  disadvantage  of  Hyde's  statistic  is  that  the  nuisance  pardfieter  H can 
affect  the  inference  to  be  drawn.  Consider  the  expectation  of  the  numerator  of 
(3.4).  For  simplicity  we  will  restrict  attention  to  the  numerator  since  the 
denominator,  under  Hq,  is  a consistent  estimator  of  the  standard  deviation  of 
the  numerator.  Furthermore,  for  ease  of  calculation,  suppose  that  G(x)  = exp(-x). 
Then 


ElogGCZ^)  = -Emin(X^.  T^)  = -/F(x)H(x)dx. 


Since 

E6.  - /F(x)dH(x), 

the  mean  of  the  numerator  of  (3.4)  is 

A(F,  H)  » /F(x)dH(x)  - /F(x)H(x)dx.  (3.5) 

Note  also  that  when  F has  a density  f,  we  can  rewrite  (3.5)  as 

A(F,  H)  - /H(x){f(x)  - F(x)}dx.  (3.6) 

Suppose  now  that  F(x)  ■ Fj(x)  = 1 - exp(-x^),  the  Weibull  distribution  with 
shape  parameter  2.  Then 

fj(x)  - Fj(x)  = (2x  - l)exp(-x2), 

so  that  from  (3.6)  we  see  that  A(F^,  H)  can  be  made  negative  by  choosing  an  H 
which  puts  most  of  its  probability  on  (0,  %)  and  analogously  A(Fj,  H)  can  be 
made  positive  by  choosing  an  H which  puts  most  of  its  probability  on  (>$,  <») . 


Hj(x) 


1.98x,  0 i X i H, 

1 - exp(-bx),  H ^ X, 


with  b » -21og. 01,  we  find  A(Fj,  Hj)  « -.16.  Thus  if  is  the  true  censoring 

distribution,  and  the  sample  size  n is  sufficiently  large,  H/de's  test  will  lead 
to  the  decision  that  the  failure  rate  2x  of  is  "larger"  than  the  constant 

failure  rate  1 of  the  hypothesized  G.  However,  for 

.002Sx,  0 s X s 2, 

.005  ♦ .99(x-2),  2 S X S 3, 

1 - exp(-cx),  3 s X, 

with  c * -(1/3) log. 005,  we  find  A(Fj,  H2)  * .11.  Thus  when  is  the  true 

censoring  distribution,  and  n is  sufficiently  large,  Hyde's  test  will  lead  to 
the  decision  that  the  failure  rate  of  F^  is  "smaller"  than  that  of  G. 

An  advantage  of  the  C test  proposed  in  Section  2 is  that  C estimates 
-/G(x)dF(x),  independently  of  the  censoring  distribution  H.  The  analogous  property 
for  W in  the  two  sample  situation  was  a motivating  factor  in  Efron's  development 

A 

of  the  test  based  on  W. 

In  a limited  study  we  have  obtained  Monte  Carlo  power  comparisons  of  the  tests 
based  on  C,  D,  and  Since  the  tests  based  on  these  statistics  are  only  asymp- 

totically exact,  our  study  also  provides  information  about  the  closeness  of  the  true 
levels  to  their  nominal  asymptotically  correct  values.  Though  the  tests  based  on  C 
and  D can  bo  Ouc-oidod  or  two-sided,  the  test  is  inherently  two-sided  and  thus 

only  two-sided  counterparts  based  on  C and  D were  used.  Furthermore,  although 
assumption  (3.2)  is  restrictive  and  not  required  by  the  C or  D tests,  in  fairness 
to  the  test  we  have  sampled  from  situations  where  (3.2)  is  satisfied  and  where 
6 is  one  of  the  values  for  which  asymptotic  percentage  points  of  41^  are  tabled  by 


Koziol  and  Green. 
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[ Table  1 compares  the  power  of  the  tests  based  on  C,  D,  and  for  (a)  G 

I 

^ specified  to  be  standard  normal  and  F taken  to  be  a normal  with  a location  shift, 

and  (b)  G specified  to  be  exponential  with  scale  parameter  1 and  F taken  to 
I be  exponential  with  a different  scale  parameter.  In  each  case  H was  selected 

I so  that  (3.2)  was  satisfied  with  either  6 > .5  or  6 « 1.  The  rough  indications 

I from  Table  1 are  that  for  situation  (a)  C is  to  be  preferred  to  which  in 

turn  is  to  be  preferred  to  0.  For  situation  (b)  C does  best  and  performs 
I better  than  D for  alternatives  close  to  the  null  hypothesis  but  trails  D 

for  alternatives  more  distant  from  Hq. 

I Although  the  values  in  Table  1 are  favourable  to  the  C test,  we  remind  the 

reader  that  it  is  easy  to  exhibit  situations  where  C will  be  inadequate.  For 
example,  under  assiuoption  (3.2)  with  0 < 3 < 2,  the  test  will  be  consistent 
when  G is  specified  to  be  N(0,  1)  and  F is  a scale  alternative,  F ■ N(0,  o^) 
with  * 1.  In  such  a case  the  C test  will  have  power  remaining  approximately 
at  a since  for  such  alternatives  /FdG  * 


I 
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Table  1.  Estiaated powers  of  'C,  D,  and  tests. 

(a)  Hypothesized  normal,  normal  location  alternatives. 


G(x)  - 


Test 

C 

D 

a\e 

0 

.01 

.013 

.018 

.05 

.038 

.061 

.10 

.096 

.114 

o\0 

0 

.01 

.007 

.014 

.05 

.043 

.050 

.10 

.096 

.094 

a\0 

0 

.01 

.018 

.010 

.05 

.044 

.060 

.10 

.086 

.127 

o\6 

0 

.01 

.010 

.013 

.05 

.052 

.057 

.10 

.097 

.108 

, F(x)  » ♦(x-0),  H - (F)® 


^2 

C 

D 

^2 

C 

n « 

20, 

8 » % 

.25 

.011 

.078 

.017 

.040 

.381 

.051 

.199 

.081 

.139 

.590 

.098 

.279 

.167 

.226 

.703 

n ■ 

50, 

0 - Ss 

.25 

.007 

.218 

.097 

.145 

.790 

.052 

.395 

.253 

.316 

.911 

.105 

.521 

.387 

.438 

.947 

n » 

20, 

8 = 1 

.25 

.013 

.056 

.011 

.025 

.346 

.057 

.152 

.058 

.103 

.527 

.103 

.227 

.128 

.163 

.628 

n « 

50, 

8 - 1 

.25 

.017 

.182 

.063 

.097 

.743 

.054 

.353 

.219 

.255 

.884 

.109 

.445 

.328 

.359 

.926 

D 

♦2 

C 

0 

*2 

.5 

.75 

.154 

.230 

.772 

.481 

.596 

.384 

.466 

.928 

.767 

.825 

.525 

.600 

.961 

.876 

.902 

.5 

.576 

.671 

.991 

.75 

.962 

.975 

.792 

.847 

.998 

.992 

.995 

.881 

.908 

1.000 

.997 

.999 

.5 

.75 

.107 

.158 

.726 

.403 

.417 

.329 

.351 

.872 

.688 

.680 

.462 

.453 

.918 

.825 

.798 

.5 

.514 

.521 

.981 

.75 

.921 

.927 

.780 

.762 

.994 

.980 

.978 

.861 

.845 

.997 

.992 

.988 

(b)  Hypothesized  exponential,  exponential  scale  alternatives. 


ff(x) 

- exp(-x). 

F(x)  ■ exp(-ex),  H ■ 

Test 

C 

D 

^2 

C 

D 

*2 

C 

D 

C 

D 

♦2 

n « 

20,  8 

« h 

a\e 

1 

.8 

.6 

.4 

.01 

.019 

.020 

.021 

.074 

.021 

.045 

.320 

.204 

.212 

.791 

.773 

.655 

.05 

.051 

.072 

.060 

.153 

.086 

.124 

.518 

.444 

.415 

.899 

.923 

.851 

.10 

.095 

.129 

.110 

.247 

.160 

.203 

.617 

.577 

.542 

.937 

.963 

.910 

n ■ 

so,  8 

» Jj 

a\e 

1 

.8 

.6 

.4 

.01 

.012 

.011 

.014 

.145 

.075 

.096 

.666 

.620 

.537 

.996 

.996 

.992 

.05 

.044 

.052 

.052 

.312 

.236 

.261 

.827 

.805 

.775 

.999 

.999 

.998 

.10 

.089 

.105 

.099 

.417 

.348 

.360 

.886 

.886 

.845 

.999 

1.000 

.999 

n ■ 

20,  8 

» 1 

a\6 

1 

.8 

.6 

.4 

.01 

.010 

.017 

.011 

.058 

.014 

.036 

.279 

.102 

.123 

.738 

.643 

.520 

.05 

.046 

.059 

.054 

.148 

.076 

.102 

.462 

.312 

.299 

.867 

.835 

.740 

.10 

.092 

.106 

.105 

.200 

.148 

.166 

.569 

.439 

.398 

.917 

.896 

.814 

n ■ 

50,  8 

» 1 

a\6 

1 

.8 

.6 

.4 

.01 

.009 

.017 

.017 

.106 

.040 

.059 

.590 

.440 

.389 

.993 

.990 

.963 

.05 

.045 

.054 

.057 

.240 

.164 

.187 

.766 

.677 

.620 

1.000 

1.000 

.990 

.10 

.093 

.102 

.111 

.335 

.259 

.273 

.834 

.784 

.733 

1.000 

1.000 

.998 

4.  EXA»f>LE 


The  data  in  Table  2,  kindly  furnished  by  Drs.  J.  A.  Koziol  and  S.  B.  Green,  t 

are  an  updated  (March,  1977)  version  of  the  data  set  used  by  Koziol  and  Green  to  jj 

(! 

illustrate  their  goodness-of-fit  test  based  on  The  data  correspond  to  211  i 

■i ' 

state  IV  prostate  cancer  patients  treated  with  estrogen  in  a Veterans  Administration  I 

1 

Cooperative  Urological  Researcii  Group  (1967)  study.  At  tae  •‘terch,  1977  closing  date  - 1 

there  tiere  90  patients  who  died  of  prostate  cancer,  105  who  died  of  other  diseases,  and  | 


16  still  alive.  Those  observations  corresponding  to  deaths  due  to  other  causes  and 
those  corresponding  to  the  16  survivors  are  treated  as  censored  observations  (with- 
drawals). As  reported  by  Koziol  and  Green  (1976),  there  is  a basis  for  suspecting 
that  had  the  patients  not  been  treated  with  estrogen,  their  survival  distribution 
for  deaths  from  cancer  of  the  prostate  would  be  exponential  with  mean  100  months. 

We  thus  applied  the  C,  D,  and  statistics  to  test  that  the  survival  distribution 
is  G^(x)  = exp(-x/100). 

* 

For  the  data  of  Table  2,  C = .69  with  a corresponding  two-sided  P value  of 

.49.  Hyde's  statistic  is  D » -.17  with  a corresponding  two-sided  P value  of  .86. 

The  value  of  the  Koziol-Green  statistic  for  the  data  of  Table  2 is  4*^  « 1.02.  Since 

the  proportion  p of  censored  obseiVations  is  121/211  * .573,  we  find  from  (3.3) 
c 

that  8 ■ 1.34.  Entering  Table  1 of  Koziol  and  Green  (1976)  at  8 3/2  with 

■ 1.02  gives  P ® .14. 

Although  all  P values  are  consistent  with  the  hypothesized  exponential  with 

mean  100,  the  value  of  is  more  suggestive  of  a possible  deviation  from  than 

* 

are  the  values  of  C and  D.  Some  insight  into  this  is  obtained  from  Figure  1 
which  contains  plots  of  the  Kaplan-Meier  estimator  F and  the  hypothesized  survival 
function  G. 

The  visual  indication  from  Figure  1 is  consistent  with  an  underlying  life  distri- 
bution F having  the  property  that  /FdG  is  close  to  Indeed,  the  value  of  C 

for  the  data  of  Table  2 is  .51.  Recall  that  C estimates  /FdG.  Similarly,  Figure  1 
suggests  that  the  average  failure  rates  of  F and  G are  close  and  thus  it  is  not 
surprising  that  Hyde's  statistic  assumes  a value  that  is  close  to  its  null  expected 
value  of  zero.  However,  in  a Cram6r-von  Nises  type  statistic  such  as  the  Koziol-Green 

A 

i|»2,  the  F(x)  - G(x)  differences  are  squared.  Thus  the  negative  deviations,  found 
here  mostly  for  the  middle  month  values,  do  not  "cancel''  the  positive  deviations, 
found  in  the  early  and  late  months  portions  of  the  axis.  This  is  a possible  expla- 
nation for  the  relatively  lower  P value  achieved  by 


- 17  - 

The  value  of  > 1.02  was  cooqnited  for  an  updated  version  of  the  data  used 
by  Koziol  and  Green  (1976).  Nevertheless,  even  allowing  for  updating,  our  value 
of  1.02  is  not  close  to  their  reported  value  of  « .484.  Although  we  are  unable 
to  obtain  the  earlier  data,  we  believe  the  Koziol-Green  value  is  incorrect  and  sus- 
pect that  an  error  in  the  value  reported  by  Koziol  and  Green  nay  have  arisen  through 
their  use,  in  ^pendix  2 of  their  paper,  of  the  sane  synbol  n to  denote  both  the 
fixed  sample  size  and  the  random  number  of  uncensored  observations.  The  possibility 
of  an  error  of  this  nature  has  been  confirmed  by  Drs.  Koziol  and  Green  in  conver- 
sations with  the  authors  of  this  paper. 

Table  2:  Survival  times  and  withdrawal  times  in  months  for  211  patients 
(with  number  of  ties  given  in  parentheses) 

Survival  times:  0(3),  2,  3,  4,  6,  7(2),  8,  9(2),  11(3),  12(3),  15(2),  16(3),  17(2), 

18,  19(2),  20,  21,  22(2),  23,  24,  25(2),  26(3),  27(2),  28(2),  29(2),  30,  31,  32(3), 

33(2),  34,  35,  36,  37(2),  38,  40,  41(2),  42(2),  43,  45(3),  46,  47(2),  48(2),  51, 

53(2),  54(2),  57,  60,  61,  62(2),  67,  69,  87,  97(2),  100,  145,  158. 

Withdrawal  times:  0(6),  1(5),  2(4),  3(3),  4,  6(5),  7(5),  8,  9(2),  10,  11,  12(3), 

13(3),  14(2),  15(2),  16,  17(2),  18(2),  19(3),  21,  23,  25,  27,  28,  31,  32,  34,  35, 

37,  38(4),  39(2),  44(3),  46,  47,  48,  49,  50,  53(2),  55,  56,  59,  61,  62,  65,  66(2), 

72(2),  74,  78,  79,  81,  89,  93,  99,  102,  104(2),  106,  109,  119(2),  125,  127,  129, 

131,  133(2),  135,  136(2),  138,  141,  142,  143,  144,  148,  160,  164(3). 

! 
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